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ABSTRACT 

DNA double-strand breaks pose a significant 
threat to cell survival and must be repaired. In 
higher eukaryotes, such damage is repaired effi- 
ciently by non-homologous end joining (NHEJ). 
Within this pathway, XRCC4 and XLF fulfill key 
roles required for end joining. Using DNA-binding 
and -bridging assays, combined with direct visual- 
ization, we present evidence for how XRCC4-XLF 
complexes robustly bridge DNA molecules. This un- 
anticipated, DNA Ligase IV-independent bridging 
activity by XRCC4-XLF suggests an early role for 
this complex during end joining, in addition to its 
more well-established later functions. Mutational 
analysis of the XRCC4-XLF C-terminal tail regions 
further identifies specialized functions in complex 
formation and interaction with DNA and DNA 
Ligase IV. Based on these data and the crystal struc- 
ture of an extended protein filament of XRCC4-XLF 
at 3.94 A, a model for XRCC4-XLF complex function 
in NHEJ is presented. 



INTRODUCTION 

DNA double-strand breaks (DSBs) pose a serious threat 
to genomic stability. DSBs result from exposure to ex- 
ogenous sources such as ionizing radiation, but can also 
result from normal cellular processes including intermedi- 
ates of both V(D)J recombination and class switch recom- 
bination in developing lymphocytes (1,2). Multiple 
mechanisms exist to repair DSBs but in mammalian 
cells, Non-Homologous End Joining (NHEJ) is the pre- 
dominant pathway utilized to repair two-ended DSBs. The 
core set of proteins required for NHEJ includes the Ku 



heterodimer, the catalytic subunit of the DNA-dependent 
protein kinase (DNA-PKcs), XRCC4, XLF/Cernunnos 
and DNA Ligase IV. Ku initially binds DSBs and 
recruits DNA-PKcs, generating the DNA-PK complex 
(3). Synapsis of two DNA-PK complexes regulates, via 
autophosphorylation, processing of damaged DNA ends 
by Artemis and other DNA modifying enzymes (4-7). 
XRCC4 DNA Ligase IV can then ligate the two DNA 
strands together. XLF directly interacts with XRCC4 
and stimulates XRCC4 DNA Ligase IV joining of DNA 
ends although the mechanistic basis for XLF's stimulatory 
effect is not understood (8-10). Both XLF~~ and 
XRCC4 -- mammalian cells exhibit increased sensitivity 
to ionizing radiation coupled with serious defects in DSB 
repair (11,12). Furthermore, XRCC4 deficiency in mice 
results in embryonic lethality (13). Although humans 
with XLF deficiency display growth retardation, and im- 
munodeficiency (14), XLF deficiency in mice results in 
only a modest immunodeficient phenotype with no effect 
on growth (15). Alt and colleagues (16) have recently 
shown, however, that mice deficient in both XLF and 
ATM have a much more severe NHEJ phenotype consist- 
ent with functional redundancy of XLF and ATM and 
suggest that like ATM, XLF may also facilitate stabiliza- 
tion or bridging of DNA ends at a DSB. 

Still, how an XRCC4-XLF complex functions during 
repair is not well-understood. Although they interact with 
one another, only XRCC4 interacts directly with DNA 
Ligase IV even though XLF stimulates ligation of 
non-cohesive DNA ends (8,10,17). Together these 
findings have led to the idea that XRCC4 and XLF act 
late during NHEJ, in particular promoting the final 
ligation step. More recently, however, live-cell imaging 
demonstrated that recruitment of XLF and XRCC4 
to damaged nuclear sites is rapid and dependent on the 
presence of Ku but not DNA-PKcs (18,19). Furthermore, 
Roy et al. (52) report that perturbations of the XRCC4- 
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XLF interaction result in DSB repair deficits including 
reduced frequency of coding-end joining during V(D)J re- 
combination but not signal-end joining. Taken together, 
these data suggest that XRCC4 and XLF may have 
multiple functions during NHEJ, including their 
well-characterized late step in ligation and also an earlier 
step that may be independent of DNA Ligase IV. 

Individual crystal structures of XRCC4 and XLF reveal 
their structural similarity, forming homodimers with 
nearly identical N-terminal head domains and long 
a-helical tails (15,20,21). Mutational analysis within the 
head domains identified residues R64, L65 and LI 15 of 
XLF, and K65 and K99 of XRCC4 as critical to XRCC4- 
XLF complex formation (20,22). Given the location of 
these residues and the dimeric nature of the proteins, it 
was proposed that XRCC4-XLF could form extended 
filament-like structures (20,22). This model has gained 
further support through recent structural analyses 
demonstrating that filaments of alternating XRCC4- 
XLF dimers can be formed in vitro (23-26). The functional 
significance of these structures is not yet under- 
stood. Here, we determine a role for XRCC4-XLF 
independent of DNA Ligase IV in bridging DSBs. 
XRCC4-XLF complexes are also shown to exist in two 
distinct states, one of which mediates DNA bridging. We 
further identify regions of XLF and XRCC4 important 
for DNA binding and demonstrate that these regions 
make independent contributions to XRCC4-XLF 
function in DNA bridging. In particular, the C-terminal 
tails of XLF are essential for XRCC4-XLF complex- 
DNA interactions, while the corresponding region of 
XRCC4 appears to regulate formation of the nucleopro- 
tein complex required for DNA bridging. Sequestration of 
XRCC4 C-terminal tails through binding of DNA 
LigaselV tandem BRCT domains disrupts this species, 
ablating DNA bridging. Based on these data and the 
crystal structure of an extended protein filament of 
XRCC4 XLF at 3.94 A, a model for the mechanism of 
XRCC4-XLF association within NHEJ is presented. 

MATERIALS AND METHODS 

XLF and XRCC4 expression vector construction 

All XLF and XRCC4 expression plasmids, with the ex- 
ception of XRCC4 1 15? , were from previously described 
work (20,21,27). See Supplementary Data for further 
details. 

Protein expression and purification 

XLF, XRCC4 and BRCT domains, wild-type and all mu- 
tations, were expressed and purified as previously 
described (20,21). XRCC4 1 " 157 was expressed in M9 
SeMET growth media kit (Medicilon Inc.). 

Electrophoretic mobility shift assays for DNA binding 

DNA preparation is described in Supplementary Data. 
Binding reactions (20 pi) contained lOOng of DNA, 
20 mM HEPES pH 8, 0.5 mM EDTA, ImM DTT, 
5% glycerol and 120 mM KC1 after addition of proteins 



at the indicated concentrations. Reactions were incubated 
at room temperature for 30min and resolved by electro- 
phoresis in a 0.8% agarose gel in Tris-borate buffer 
(89 mM Tris-base, 89 mM boric acid, 2mM EDTA pH 
8.3) at 80V/15cm for 60 or 90min. Gels were stained in 
Tris-borate buffer supplemented with 0.5 ug/ml ethidium 
bromide (30min) and destained in deionized water (2h). 

DNA bridging 

DNA bridging was carried out as illustrated in Figure 2A. 
Full details are provided in Supplementary Data. 

Scanning force microscopy 

DNA substrates and XRCC4-XLF complexes were 
prepared as described in Supplementary Data. Images 
were obtained on a NanoScope IV SFM (Digital 
Instruments; Santa Barbara, CA) operating in tapping 
mode in air with a type E scanner. Silicon Nanotips 
were from Digital Instruments (Nanoprobes). The length 
and height measurements were done with NanoScope 
software v 5.12 (Digital Instruments; Santa Barbara, 
CA, USA). Although absolute dimensions in SFM are 
distorted by tip convolution, relative size and separation 
between objects can be used very accurately. 

Crystallization and data collection of 
XRCC4 1157 -XLF'- 224 

Crystals were grown using the hanging drop vapor diffu- 
sion method. A 1 pi protein solution (50 pM XLF 1-224 : 
100 uM XRCC4 1 " 157 in 20 mM Tris, pH 8, 200 mM 
KC1, ImM EDTA, 10 mM DTT and 10% glycerol) was 
combined with microseeded crystallization solution (0.8 pi 
of 1.8 M tri-ammonium citrate, pH 8) and additives (0.2 pi 
each 0.1 M barium chloride dihydrate and 2.0 M sodium 
thiocyanate). Crystals were dehydrated over 4M 
ammonium sulfate, pH 7, prior to a 3-h soak in phasing 
solution before freezing (1 pi 0.5 mM tantalum bromide 
and 0.5 pi of 60% PEG 8000). Full crystallization details 
are described elsewhere (28). Diffraction data were col- 
lected at NSLS, X25 (Brookhaven, NY, USA). 
Coordinates for the structure have been deposited to the 
PDB, accession code 3RWR. 

Structure determination and model refinement 

Data was processed with HKL2000 (29). Phases were 
determined using MR-SAD, followed by density modifi- 
cation with Phenix. An XRCC4-XLF docking model was 
used as a search model (22). Eight tantalum bromide sites 
were found. Iterative rounds of model building and refine- 
ment were carried out using Phenix, Wincoot and CNS 
vl.3 with DEN refinement (30-33). DEN refinement was 
chosen based on its ability to refine lower resolution struc- 
tures. Interacting regions were analyzed using ZMM (34). 
All structural figures were generated using PyMol (The 
PyMOL Molecular Graphics System, Version 1.2r2, 
Schrodinger, LLC). Buried surface area calculations 
were carried out through the PISA server, v. 1.1 8 (35). 
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RESULTS 

Identification of regions required for XRCC4-XLF 
complex-dependent DNA binding 

Here we have localized the DNA-binding interface of 
XRCC4 and XLF to residues 157-200 and 224-299, 
respectively (Figure 1A, lane 3 and Figure IB, lane 3). 
Further mutational analysis within XRCC4 identified 
amino acids El 70 and R192 as necessary for DNA 
binding (Figure 1A, lanes 4-5). These residues fall in 
a highly conserved segment of XRCC4 also responsible for 
homo-tetramerization and interaction with DNA Ligase 
IV-tandem BRCT domains (BRCTs) (Supplementary 
Figure SI). Several additional residues located between 
157 and 200 were also implicated in DNA binding 
(Supplementary Table SI). Like XRCC4, DNA binding 
of XLF resides in its tail region (20). This region of XLF 
has also been shown to mediate XRCC4-XLF parallel 
filament arrangements (25). Alanine substitution at K293 
decreased the overall DNA-binding capacity of XLF 
(Figure IB, lane 4), localizing XLF-DNA-binding 
activity to a highly conserved lysine cluster at the 
C-terminus, similar to reports for XLF's yeast homolog, 
Nejl (36) (Supplementary Figure S2). Additional 
DNA-binding sites within the head region of XLF have 
been identified by HDX (Hydrogen Deuterium Exchange) 
experiments (25); however, based on our studies with 
XLF 1-224 the head region alone does not support stable 
DNA binding (Figure 1). 

We next tested DNA-binding activity under conditions 
where XRCC4-XLF complexes are formed (20) and 
observed a super-shifted protein-DNA complex that 
migrated as a diffuse smear up into the well of the gel 
suggesting formation of a heterogeneous collection 
of large nucleoprotein complexes (Figure 1A, lane 7; 
Figure IB, lane 6). Interestingly, when XRCC4-DNA- 
binding mutants were present, only an intermediate 
gel-shift was observed (Figure 1A, lanes 8-10). This inter- 
mediate species migrated faster than the heterogeneous 
super-shifted species, but slower than XLF- or XRCC4- 
DNA complexes on their own. Similar to XLF- and 
XRCC4-DNA complexes, the intermediate species 
migrated as a discrete, homogeneous complex. Like the 
super-shifted complex, the intermediate complex is also 
dependent on the presence of both XRCC4 and XLF. 

Unlike XRCC4, XLF truncations and point mutants 
unable to bind DNA, in complex with wild-type 
XRCC4, produced either no shift (XLF 1-224 ), or an inter- 
mediate shift (XLF-K293A) (Figure IB) indicating that 
the DNA-binding activity of XLF but not XRCC4 is ne- 
cessary for XRCC4-XLF-DNA association. Complex 
observed in lane 8 is likely due to a partial 
DNA-binding defect of XLF-K293A (Figure IB, lane 3 
versus 4). DNA binding in the C-terminal region of XLF 
is clearly more important for XRCC4-XLF complex- 
DNA interactions than that of XRCC4 (Figure IB, lane 
7 versus Figure 1A, lane 8). Disruption of DNA binding 
by C-terminal deletion of XLF fully ablates XRCC4 XLF 
complex-DNA association, while XRCC4-DNA binding 
within the complex is dispensable, as the intermediate 
species is still observed. Furthermore, previously identified 
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Figure 1. XRCC4-XLF DNA binding. (A) XRCC4 wild-type (WT) 
and mutants (8 uM) were incubated with 100 ng of DNA with or 
without 2uM XLF (WT and mutants), analyzed by EMSA. (B) WT 
XLF and its mutants (2u.M) were incubated with lOOng of DNA with 
or without 8 uM XRCC4 (WT and mutants). (C) Effect of DNA Ligase 
IV tandem BRCT domains on XRCC4-XLF-DNA complex forma- 
tion. XRCC4 (8uM) and XLF (2uM) were incubated with 100 ng 
DNA fragments in the presence of increasing amounts of DNA 
Ligase IV tandem BRCT domains (BRCTs, 0, 0.5, 1, 2, 4-8 uM). 



mutations (20) that prevent XRCC4-XLF interaction 
fail to bind DNA beyond individual wild-type pro- 
teins, indicating XRCC4-XLF complexes are required 
to form both intermediate and super-shifted species 
(Supplementary Figure S3). 
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Figure 2. Bridging of DNA molecules by XRCC4-XLF. (A) Schematic of DNA-bridging assay. Proteins were incubated with magnetic beads linked 
to 1000-bp DNA and free 500-bp DNA. Beads were separated from supernatant and analyzed separately for presence of the 500-bp DNA. (B) an 
amount of 200 ng each of 1000- and 500-bp DNA fragments were incubated with XRCC4 (2uM), XLF (2uM) or DNA Ligase IV tandem BRCT 
domains (BRCTs, 2uM). Top panel shows the analysis of the protein-DNA complexes in the supernatants. Bottom panel shows the recovery of 
DNA species on the beads. L = 1 kb DNA ladder (NEB). (C) Bridging assays performed as in (B) with mutants preventing XRCC4-XLF filament 
formation. (D) Bridging assays performed as in (B). XRCC4 1-157 and XLF 1- 2 are truncated proteins lacking C-terminal tails and DNA-binding 
activity. 



DNA Ligase IV and DNA-binding regions of XRCC4 
modulate XRCC4-XLF complex-DNA interactions 

Residues 155-195 of XRCC4 interact with DNA Ligase 
IV BRCTs, including direct contact with R192 (37). Here 
we show that R192 and other residues within this region 
also mediate interaction with DNA (Supplementary Table 
SI) (38). Accordingly, simultaneous incubation with 
BRCTs precluded XRCC4s interaction with DNA 
(Figure 1C, lane 12) suggesting these binding events are 
indeed mutually exclusive. Consistent with this observa- 
tion, the super-shifted species was converted to the 



intermediate species in the presence of equimolar concen- 
trations of the tandem BRCT domains of DNA Ligase IV 
(Figure 1C, lanes 7-9). The intermediate species is not 
disrupted, however, even at the highest concentrations of 
BRCTs tested, and is reminiscent of the complex observed 
in Figure 1 A and B with wild-type XLF and XRCC4 trun- 
cations and point mutants unable to bind DNA. The ob- 
servation of multiple nucleoprotein complexes 
(super-shifted and intermediate) suggests that XRCC4- 
XLF complexes can associate with DNA in at least two 
distinct ways reflecting changes in shape and/or number of 
XRCC4 XLF complexes and DNA. 
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XRCC4-XLF complexes bridge DNA molecules 

During DSB repair, DNA ends are maintained in close 
proximity by higher order nucleoprotein complexes (39). 
The finding that XRCC4-XLF complexes interact with 
DNA, suggests these structures may be important for 
bridging DNA during repair. To test this hypothesis, we 
followed the experimental design outlined in Figure 2A. 
If XRCC4-XLF bridges the 1000- and 500-bp DNA, then 
the 500-bp DNA will be recovered from streptavidin- 
coated beads. XLF and XRCC4 on their own were 
unable to bridge DNA (Figure 2B, lanes 2-3). However, 
in the presence of both XRCC4 and XLF, 500-bp DNA 
was efficiently recovered from the beads, reflecting the 
ability of XRCC4 XLF complexes to bridge DNA mol- 
ecules (Figure 2B, lane 4). XLF and XRCC4 mutants that 
abolish XRCC4-XLF interaction did not support DNA 
bridging, suggesting that XRCC4-XLF complex mediates 
bridging activity (Figure 2C). 

Similar to the effect seen in DNA binding, DNA 
bridging by XRCC4-XLF was also disrupted by the 
presence of BRCTs (Figure 2B, lane 5). Since the inter- 
mediate species was still present in the supernatant when 
bridging was abolished, it appears that formation of the 
heterogeneous, super-shifted species is required for DNA 
bridging. One would further expect disruption of DNA 
bridging by other factors that alter the C-terminal tail of 
XRCC4 such as XRCC4 truncations and point mutations 
unable to bind DNA, or phosphorylation by DNA-PK, as 
illustrated by Roy et al. (52). 

As expected, complex-dependent DNA bridging was 
abolished when an XRCC4 C-terminal truncation 
unable to bind DNA was assayed for bridging capabilities 
(Figure 2D). Any truncation of either XLF or XRCC4 
that fully disrupts DNA binding also prevented DNA 
bridging (Figure 2D). In particular, reactions carried out 
with XRCC4 1 157 do not bridge DNA, but are able to sup- 
port complex-DNA binding, even though XRCC4 1 157 is 
unable to bind DNA alone (Figure 2D, lanes 5 and 9). 
Together these results demonstrate that XRCC4 tails are 
not required for complex-dependent DNA binding, yet are 
still necessary for DNA bridging, which requires forma- 
tion of a heterogeneous super-shifted nucleoprotein 
complex. 

Direct visualization of higher-order XRCC4-XLF 
nucleoprotein complexes 

Observation of multiple nucleoprotein complexes during 
DNA binding and bridging studies suggests that at least 
two distinct higher order protein-DNA complexes are 
formed by XRCC4-XLF. To further investigate this pos- 
sibility, we visualized such complexes using Scanning 
Force Microscopy (SFM), under conditions identical to 
DNA-binding and -bridging experiments. 

XRCC4 and XLF were imaged independently and 
appeared as monodisperse objects (data not shown). 
DNA alone displayed dimensions characteristic for this 
type of SFM imaging (1821 -bp DNA measured 
6000 x 180 x 2 A) (Figure 3C). When XRCC4 and XLF 
were incubated together in the absence of DNA, mono- 
disperse objects were still observed (Figure 3D, small blue 



objects), however additional, larger, irregular species 
(Figure 3D, green arrows, 30-60 A high and 400-800 A 
wide) and elongated objects (Figure 3D, yellow arrow, 
~ 1300 A long, 20-25 A high and 300 A wide) were also 
observed. Addition of DNA to XRCC4-XLF generated 
large protein-DNA networks (Figure 3A and B). DNA 
molecules were identified based on similar width and 
height to the DNA only control (Figure 3C). XRCC4- 
XLF complexes arranged DNA in two distinct ways. 
First, multiple DNAs were aligned end-to-end as indicated 
by complexes that were several fold longer than the length 
of the individual 1.8-kb DNA fragments (compare DNA 
in Figure 3C and A). In addition, DNA molecules were 
found aligned in parallel with an even spacing of ~200 A 
(Figure 3A and B, black circles). Given the regularity and 
extensive occurrence of these parallel DNA arrangements 
it is likely they reflect specialized repair complexes. Each 
of these DNA arrangements appear connected by 
XRCC4-XLF complexes that appear as objects higher 
and wider than DNA alone (Figure 3A and B, green 
and black arrows). The largest of these XRCC4-XLF 
complexes were ~30-60A high and 400-800 A wide 
(Figure 3A and B, green arrows) while the smaller 
XRCC4 XLF complexes were 4-7 A high and 110— 
180 A wide (Figure 3B, black arrows). Either or both of 
these DNA-protein (DNA end-to-end or parallel) ar- 
rangements may be essential for the DNA-bridging 
activity observed in Figure 2. 

Consistent with results from bridging assays (Figure 
2B), addition of BRCTs fully disrupted the bridged 
DNA networks observed via SFM in Figure 3A, without 
ablating protein-DNA associations (Figure 3E, green and 
black arrows). The majority of protein-DNA complexes 
observed in the presence of BRCT domains measured ~4- 
7 A high and 1 10-180 A wide (black arrows), making them 
roughly equivalent in size to the smaller nucleoprotein 
complexes observed in the absence of BRCT domains 
(Figure 3B, black arrows). Larger complexes observed in 
the presence of BRCT domains (Figure 3E, green arrow) 
remained similar in size to those indicated by green arrows 
in Figure 3A, B and D. 

Structure of the XRCC4 1157 -XLF I_224 complex 

To further investigate the structural relationship(s) of 
XRCC4-XLF, we crystallized domains of these proteins 
responsible for complex formation. To facilitate crystal- 
lization, flexible C-terminal regions were removed from 
full-length XRCC4 (1-336) and XLF (1-299). XRCC4 1 
157 -XLF 1-224 crystals diffracted to ~3A o resolution; 
however, anisotropy limited the data to 3.94 A resolution. 
Phase information was obtained using a combination of 
molecular replacement and single-wavelength anomalous 
diffraction (MR-SAD). Representative electron density is 
provided in Supplementary Figure S4. XRCC4 1 " 57 - 
XLF 1-224 crystals grew in spacegroup C2 with one ex- 
tremely long axis (~750A) and contained six dimers of 
each protein in the asymmetric unit. The structure was 
refined to R and R tree values of 26.2 and 32.9, respectively 
(PDB 3RWR). Table 1 lists data collection and refinement 
statistics. 
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Figure 3. SFM analysis of protein-DNA networks. (A) Large protein-DNA networks observed in XRCC4-XLF DNA-binding reactions. 
Nucleoprotein complexes appear as higher wider objects (~30-60A high and 400-800 A wide, green arrows). Such complexes were typical and 
observed on all 16 of the 2x2 micron images collected for this sample. Protein-induced parallel bridging of DNA molecules is evident (black circles) 
and occurs in many places along the long complex. Extended DNA networks indicate DNA molecules are positioned end-to-end. White box indicates 
area enlarged in (B). (B) Enlargement of (A) highlighting parallel DNA molecules. Spacing between parallel DNA molecules is ~200A. Smaller 
nucleoprotein complexes, measuring 4-7 A high and 110-180 A wide, are also evident (Figure 3B, black arrows). (C) The 1.8-kb linear DNA, contour 
length 6000 A, width 180 A and height 2 A. (D) Image of XRCC4-XLF complexes in absence of DNA. Small protein complexes, likely dimers, 
tetramers and small multimers, appear as blue uniform objects distributed over the surface. Larger protein complexes (30-60 A high and 400-800 A 
wide) and elongated forms were s also present and are indicated by green and yellow arrows, respectively. Elongated structures (yellow arrow) 
measuring ~ 1300 A long, 20-25 A high and 300 A wide, were observed with a frequency of approximately one for every 2x2 micron image 
inspected. (E) Addition of DNA Ligase IV BRCT domains disrupts nucleoprotein networks. Smaller protein complexes similar to (D) are uniformly 
distributed and appear as small blue objects. DNA molecules associated with protein complexes are also observed (25-60 A high and 400-800 A wide, 
green arrow) with a frequency of approximately one for every 2x2 micron image inspected. Smaller nucleoprotein complexes (4-7 A high and 1 10— 
180 A wide, black arrows) were observed at higher frequency. (A) is 2 x 2 microns. (B) is 500 x 500 nm. (C-E) are 1 x 1 microns. In all images the 
white bar is 2000 A long and height is indicated by color (0-3 nm red to yellow/white, scale bar in panel B). Note that biomolecule dimensions are 
distorted in SFM images. X-Y dimensions^ increase due to tip convolution and Z (height) decreases relative to tip surface and tip molecule 
interactions. For instance DNA, that is 20 A wide and 20 A high based on crystal structure of B-form, typically measures 200 A wide and 2-5 A 
high in our SFM images. Relative size and separation between objects can be used very accurately. 



All dimers of XRCC4 1157 and XLF 1-224 within the asym- 
metric unit were part of continuous filaments extending the 
length of the crystal. Each left-handed helical filament was 
composed of alternating XRCC4 1 157 and XLF 1 " 224 dimers 
with an ~30° offset (Figure 4A). The filament diameter is 
~220 A, with an internal pore of ~70 A, toward which both 
proteins' N-terminal heads are directed. For XRCC4 ~ 57 , 



the C-terminal tail region extends outward while the analo- 
gous region of XLF 1-224 is directed toward the inner pore 
(Figure 4B and C), however, since both of these tail regions 
are highly flexible it is possible that they may not continue 
in a linear trajectory. 

Closer inspection of the XRCC4 1 " 157 -XLF 1 " 224 struc- 
ture reveals formation of a (3-sheet between strands p5-7 
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of XRCC4 1157 and |J4' of XLF 1-224 (Figure 4D). In the 
absence of XRCC4 1157 , |34' forms a loop between a2 and 
oc3 of the helix-turn-helix in XLF 1 " 224 , indicating that 
small conformational changes result from complex 



Table 1. Crystallographic data and refinement statistics 



Data collection 



Wavelength (A) 


1.2536 


Spa.ce group 




Cell parameters (A) 


a = 745.4, b = 149 




a = y = ft = 


Molecules in A.U. 


24 


Resolution range (A) a 


50.0-3.9 (4.2-3.9) 


Unique reflections 


78630 


Data redundancy 11 


6.7 (6.6) 


Completeness (%) a 

//CT(/r „ 


97.6 (98.4) 


11.7 (2.1) 


Emerge (%) 


15.2 (96.3) 


Mosaicity 


0.35 


Wilson scaling B factor (A ) 


168.55 


Model and refinement 




Resolution range (A) 


50.0-3.94 


^work (%) 


27.0 


^free (%) 


32.6 


Reflections observed 


73 366 


Reflection test set 


3886 


Number of protein atoms 


3464 


RMSD bond lengths (A) 


0.0028 


RMSD bond angles (A) 


0.77 


Average B factor (A ) 


176.5 



"Statistics for the highest data resolution shell are shown in 
parentheses. 



formation. The XRCC4 1 157 -XLF'" 224 interface is 
further strengthened through direct amino acid inter- 
actions (Figure 4E). All amino acids involved in this inter- 
face are listed in Supplementary Table S2. L65 and LI 15 
of XLF make key contributions by sandwiching a highly 
conserved XRCC4 F106 (Supplementary Figure SI and 
S2). In addition, K99 of XRCC4 makes hydrophobic 
interactions with LI 15 of XLF, through its aliphatic 
chain and also forms a hydrogen bond with the carbonyl 
oxygen of SI 13 in XLF (Figure 4E). These interactions are 
consistent with recently reported mutational and struc- 
tural analyses (21,23-27). 



DISCUSSION 

The filamentous structure of XRCC4-XLF observed here 
is consistent with recently reported structures of these 
proteins (24-26). Despite these structures having been 
determined under different conditions, with differently 
sized protein truncations and in different crystal packing 
environments, little difference is observed in the inter- 
action interface between XRCC4 and XLF. Combined 
with supportive biochemical analysis, mounting structural 
data strongly suggests that the interface observed here 
reflects a biologically relevant XRCC4-XLF complex. 
Despite these similarities, interesting differences were 
observed in the relative helical pitch associated with dif- 
ferent filament structures (720, 760, 840 and 860 A) 
(24-26). The ability of XRCC4 XLF filaments to 




Figure 4. Crystal structure of the XRCC4 1 " 157 -XLF 1 " 224 complex. (A) Interaction of XLF 1 " 224 homodimer (orange) with an adjacent XRCC4 1157 
homodimer (blue) as seen in the crystal structure. Black arrow illustrates -30° offset between homodimers. (B) XRCC4 1I57 -XLF 1-224 filament. 
XRCC4 1- ' 57 homodimers are numbered. Colored arrows indicate directionality of XRCC4 1 " 157 (blue) and XLF 1 " 224 (orange) C-ter minal tails. 
Filament diameter is indicated in A. (C) XRCC4 1157 -XLF 1224 filament, rotated 90° clockwise from (B). Length of one filament revolution is 
indicated in A. (D) XRCC4 1 " 157 -XLF 1 " 224 head-to-head interface. (E) Key amino acids directly involved in XRCC4 1_157 -XLF 1 - 224 head-to-head 
interaction. 
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accommodate changes in pitch indicates these structures 
possess elastic potential that one would expect for 
scaffolding molecules associated with bridging DNA. 
In addition, altered pitch results in variable internal pore 
sizes that could accommodate other molecules ranging in 
size from dsDNA to an entire nucleosome (24-26). 
Precedence for functional protein filaments in DNA 
repair is seen in NHEJ's counterpart, Homologous 
Recombination (HR). HR utilizes a nucleoprotein 
filament conserved in humans (RAD51), bacteria (RecA) 
and archaea (RadA)(4(M2). RAD51 RadA forms 
protein filaments through a P-strand polymerization 
motif, where the P-strand of one RAD51-RadA protein 
binds to the P-sheet of an adjacent RAD51-RadA 
monomer (43,44). In a similar manner, RecA forms a con- 
tinuous P-sheet across monomers within its growing 
filament. As an isolated protein, however, the interacting 
P-strand exists as a flexible loop (45). This is identical to 
what occurs in the loop region of XLF, when in contact 
with XRCC4 (Figure 4D, 04'). Furthermore, RadA, like 
XRCC4-XLF, also forms a left-handed protein helix to 
bind DNA (43). The crystal structure of DNA-bound 
RecA further shows that RecA uses an extended 
filament structure to wrap around DNA (46). 

DNA binding by XRCC4 and XLF is reported to occur 
in a highly cooperative manner (27), suggesting a protein 



filament may form along DNA similar to what is observed 
with RecA or RAD51. In our structure, the DNA-binding 
region of XLF is situated toward the inner helical pore 
(Figure 4B), which has sufficient room (70 A) to accom- 
modate both DNA and the XLF DNA binding domain 
(residues 224-299) suggesting that XRCC4-XLF may 
wrap around DNA. However, it is equally possible for 
DNA to be wrapped around a filament as suggested by 
Hammel et al. (24). This mode of interaction is remark- 
ably similar to how histone-like nucleoid structuring 
protein, H-NS, has been observed to bind DNA (47). 

Results from DNA-binding studies presented here 
indicate two modes of protein-DNA interaction (an inter- 
mediate and heterogeneous super-shifted species), suggest- 
ing nucleoprotein filaments exist in more than one 
structural state. Of these, only the super-shifting species 
correlated with DNA bridging. XRCC4-XLF nucleopro- 
tein complexes were further characterized via SFM and 
shown to interact with DNA in two general arrangements. 
The larger protein complexes observed in Figure 3 (green 
arrow) greatly exceed the size of a single XRCC4-XLF 
filament, consistent with multiple filament bundles 
observed recently both crystallographically and by SAXs 
analysis (24-26). Further support for this arrangement 
comes from direct visualization by transmission electron 
microscopy (24). In this way, DNA could be fully coated 
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Figure 5. Model of XRCC4-XLF filaments bound to DNA. (A) Multiple adjacent filaments bound to DNA (yellow). Each color is a separate 
filament. DNA runs through the pore of the XRCC4-XLF filament. (B) Tails of an XRCC4 1-157 homodimer (blue) point toward the N-terminus of 
XLF 1-224 ( orange)i 

in an adjacent filament. (C) XRCC4 dimers associate into a tetramer through C-terminal tails (PDB 1FU1). DNA-bridging 
models illustrating single and complex filaments. Two DNA molecules coated in a simple or multi-filament bundle are bridged through XRCC4 
C-terminal tails. 
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Figure 6. Summary of the structural states of XRCC4. Structural states of XRCC4 are indicated with their associated function (PDB 1FU1 and 
PDB 3116; 21,23,37,38). 



by multiple adjacent filaments (Figure 5A). Evidence for 
this arrangement is also observed in the crystal structure 
presented here and elsewhere (26). In particular, when 
modeled within the context of a filament bundle, the 
C-terminal tails of some XRCC4 dimers are directed 
toward the head domains of XLF in an adjacent 
filament (Figure 5B). This arrangement may explain why 
XRCC4 tails are required for formation of the larger 
nucleoprotein complexes capable of bridging DNA 
(Figures 2 and 3). Alteration of these tails through 
mutation, binding of BRCTs as observed in a previous 
structure with XRCC4 (37) or DNA-PK phosphorylation 
[Roy et al. (52)] could disrupt interactions between 
adjacent filaments resulting in smaller, less complex nu- 
cleoprotein assemblies, perhaps corresponding to the 
intermediate species observed in Figures 1 and 2, which 
are unable to bridge DNA. Given the structural informa- 
tion currently available (24-26) it is not obvious how 
BRCT binding to XRCC4 alone would disrupt single or 
multiple XRCC4-XLF filament(s). Current structures of 
XRCC4-XLF complexes lack the BRCT-binding region 
of XRCC4 making it difficult to assess a precise mechan- 
ism for BRCT-mediated disruption of the XRCC4-XLF 
bridging complex. However, results from SFM suggest 
alignment of DNA molecules both end-to-end and 
parallel to one another, which may account for disruption 
of XRCC4 XLF by BRCT domains. Interestingly, the 
same tail region of XRCC4 that is required for DNA 
bridging also corresponds to its homo-tetramerization 
region (38). It is therefore possible, that through homo- 
tetramerization, adjacent filament bundles bound to DNA 
could be bridged (Figure 5C). The spacing observed 
between parallel DNA molecules in SFM (~200A) 



correlates well with those predicted in this model (Figure 
5C). Since tetramerization and BRCT binding are 
mutually exclusive, addition of equimolar amounts of 
BRCT domains would be expected to disrupt all inter- 
actions dependent on XRCC4 tails, as was observed via 
SFM (Figure 3). It is important to note that the data pre- 
sented here and resulting model are both compatible with 
DNA being present on the interior or exterior of XRCC4- 
XLF filaments, and perhaps even on the interior and the 
exterior at the same time. The model for XRCC4-XLF 
complexes presented in Figure 5 suggests that the filament 
observed within the asymmetric unit of the crystal struc- 
ture presented here reflects a biologically relevant struc- 
ture. At this time, however, no direct evidence exists to 
support filament formation in vivo. 

DNA-bridging activity of XRCC4 XLF is consistent 
with a role in stabilizing DNA ends during NHEJ. 
Interestingly, binding of DNA Ligase IV apparently 
causes these bridging-competent complexes to be 
remodelled such that they no longer stably bridge DNA. 
If XRCC4 is available in excess over DNA Ligase IV, 
both types of complexes (XRCC4-XLF and XRCC4- 
Ligase IV) should be able to exist simultaneously. 
Furthermore, at substoichiometric concentrations of 
DNA Ligase IV (BRCT domains) XRCC4-XLF 
super-shifted complexes remain intact and are able to 
bridge DNA (Figure 1 and data not shown). Together 
this suggests that XRCC4-XLF multifilament complexes 
may persist when bound by a limited number of DNA 
Ligase IV molecules. 

Stable bridging of DNA ends by XRCC4 XLF 
complexes is further supported by recent in vivo analysis 
suggesting XRCC4-XLF complex assembly is necessary 
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for repair [Roy et al. (52)]. In NHEJ, Ku and DNA-PKcs 
have been implicated in synapsis of DNA ends (48). 
However recent studies show rapid, DNA-PKcs- 
independent recruitment of XLF to damaged sites by 
Ku that is further stabilized at DNA breaks by the recruit- 
ment of XRCC4 suggesting XRCC4 XLF plays an early 
role in repair (18,19). Collectively these results imply that 
Ku may initially recruit XLF to the damaged site and 
nucleate XRCC4-XLF-bridging complex formation 
along DNA (18,19,24-26). Regulation of this DNA- 
bridging complex (Figures 2 and 3) occurs via modifica- 
tions to the C-terminus of XRCC4 (summarized in 
Figure 6). Interestingly, Roy et al. further demonstrate 
that DNA-PK hyper-phosphorylation within the same 
region of XRCC4 also modulates DNA-bridging activity 
(52). There are now several proteins reported to bridge 
DNA ends (Ku, ATM and MRN) (49-51). It will be inter- 
esting to determine how each of these proteins contribute 
to repair via their bridging activity of various types of 
DNA damage in vivo. 

Together this work presents new insight into the mech- 
anism for how XRCC4-XLF complexes function in 
NHEJ. The ability of XRCC4-XLF to stably bridge 
DNA ends is consistent with previous observations and 
provides a mechanistic basis for how XLF stimulates 
XRCC4-DNA Ligase IV non-cohesive end joining 
(8-10). How these structures assemble with one another 
and other NHEJ factors for repair in vivo will require 
further investigation. 
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